A viewing and processing tool for the analysis of a comparable corpus of Kiranti mythology
نویسندگان
چکیده
This presentation describes a trilingual corpus of three endangered languages of the Kiranti group (Tibeto-Burman family) from Eastern Nepal. The languages, which are exclusively oral, share a rich mythology, and it is thus possible to build a corpus of the same native narrative material in the three languages. The segments of similar semantic content are tagged with a "similarity" label to identify correspondences among the three language versions of the story. An interface has been developed to allow these similarities to be viewed together, in order to allow make possible comparison of the different lexical and morphosyntactic features of each language. A concordancer makes it possible to see the various occurrences of words or glosses, and to further compare and contrast the languages.
منابع مشابه
Corpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملآثار ارزیابی شناختی و سرکوبگری هیجانی بر واکنشهای عصبی خودکار بر اساس حساسیت پردازش حسی
Objectives The aim of this study was to evaluate the effect of emotion regulation strategies of cognitive appraisal and emotional suppression on autonomic nervous reactions based on high and low sensory processing sensitivity among students. Methods For this purpose, 500 students of Bu Ali Sina University of Hamadan were selected through a stratified sampling approach. Based on final score dis...
متن کاملHedges in English for Academic Purposes: A Corpus-based study of Iranian EFL learners
Hedges, as tools to express tentativeness and doubt, have been studied in plenty of research papers in the Iranian EFL research setting. However, their use in a learner corpus, portraying Iranian learner English, is in need of more research attention. With this end in view, this study aimed at investigating how Iranian EFL learners who have majored in English-related fields in Iran deployed hed...
متن کاملPsychological Analysis of Kiumarth Myth in the Light of the Personality Psychology of Jung
Mythology allocated a large part, fundamental and effectively to the human mind. The knowledge of mythology in fact recognizes the important infrastructure of ideas, culture and civilization. One of the most common ways to study mythology is to implement psychological ideas in mythology. The result is not only a better understanding of mythology, but also a better understanding of human psyche ...
متن کاملProducing a Persian Text Tokenizer Corpus Focusing on Its Computational Linguistics Considerations
The main task of the tokenization is to divide the sentences of the text into its constituent units and remove punctuation marks (dots, commas, etc.). Each unit is a continuous lexical or grammatical writing chain that is an independent semantic unit. Tokenization occurs at the word level and the extracted units can be used as input to other components such as stemmer. The requirement to create...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012